Soft-decision a Priori Knowledge Interpolation for Robust Telephone Speaker Identification

نویسندگان

Yuan-Fu Liao

Jyh-Her Yang

Sin-Horng Chen

چکیده

Handsets which are not seen in the training phase (a.k.a unseen handsets) are main sources of performance degradation for speaker identification (SID) applications in telecommunication environments. To alleviate the problem, a soft-decision a priori knowledge interpolation (SD-AKI) method of handset characteristic estimation for handset mismatch-compensated SID is proposed in this paper. The idea of the SD-AKI method is to first collect a set of characteristics of seen handsets in the training phase, and to then estimate the characteristic of the unknown testing handset by interpolating the set of seen handset characteristics in the test phase. The estimated handset characteristic is then used to compensate for handset mismatch for robust SID. The SD-AKI method can be realized in both feature and model spaces. Experimental results on the handset TIMIT (HTIMIT) database showed that both the proposed featureand model-space SD-AKI schemes were more robust than the blind cepstral mean subtraction (CMS), feature warping (FW) methods and their hard-decision counterpart (HD-AKI) for both cases of all-handset and unseen-handset SID tests. It is therefore a promising robust SID method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unseen handset mismatch compensation based on a priori knowledge interpolation for robust speaker recognition

Unseen handset mismatch is the major source of performance degradation for speaker recognition in telecommunication environment since handset distortions are tightly coupled with speaker characteristics. In this paper, a soft-decision unseen handset characteristics estimation method based on a priori knowledge interpolation is proposed to decouple the characteristics of the unseen handset and s...

متن کامل

A robust aggregation operator for multi-criteria decision-making method with bipolar fuzzy soft environment

Molodtsov initiated soft set theory that provided a general mathematicalframework for handling with uncertainties in which we encounter the data by affix parameterized factor during the information analysis as differentiated to fuzzy as well as bipolar fuzzy set theory.The main object of this paper is to lay a foundation for providing a new application of bipolar fuzzy soft tool in ...

متن کامل

Rapid speaker adaptation by reference model interpolation

We present in this work a novel algorithm for fast speaker adaptation using only small amounts of adaptation data. It is motivated by the fact that a set of representative speakers can provide a priori knowledge to guide the estimation of a new speaker in the speaker-space. The proposed algorithm enables an a posteriori selection of reference models in the speakerspace as opposed to the a prior...

متن کامل

Robust text-independent speaker identification using Gaussian mixture speaker models

This paper introduces and motivates the use of Gaussian mixture models (CMM) for robust text-independent speaker identification. The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are efTective for modeling speaker identity. The focus of this work is on applications which require high identification rates using short utterance ...

متن کامل

Effect of Decision Rule on Speaker Recognition Performance

Speaker recognition from speech signal is still an ongoing research in forensics and biometrics area. Speaker recognition is the process to enable machine to recognize speaker's identity from their speech. The applications of speaker recognition technologies include access control system, security control for confidential information, and telephone banking. As a subset of speaker recognition, s...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2009

Soft-decision a Priori Knowledge Interpolation for Robust Telephone Speaker Identification

نویسندگان

چکیده

منابع مشابه

Unseen handset mismatch compensation based on a priori knowledge interpolation for robust speaker recognition

A robust aggregation operator for multi-criteria decision-making method with bipolar fuzzy soft environment

Rapid speaker adaptation by reference model interpolation

Robust text-independent speaker identification using Gaussian mixture speaker models

Effect of Decision Rule on Speaker Recognition Performance

عنوان ژورنال:

اشتراک گذاری